Measurement of the Top Quark Pair Production Cross Section in the All-Jets Decay 
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The standard model predicts that, at Tevatron ener- 
gies, top quarks are produced primarily in ti pairs, and 
that each top quark decays into a b quark and a W bo- 
son. 44% of these events are expected to have both W 
bosons decay into quarks. These pure hadronic, or "all- 
jets", ti events are among the rare collider events with 
several quarks in the final state. With no final state 
energetic neutrinos, the all-jets mode is the most kine- 
matically constrained of the top quark decay channels, 
but is also the most challenging to measure due to the 
large QCD multijet background. This compelled us to 
use unique tools such as quark/gluon jet differences, and 
to make extensive use of neural networks, to separate the 
ti final states from the QCD background B. The com- 
parison of ti cross sections from the all-jets and lepton + 
jets channels allows a search for new phenomena in top 
decays; for example, top decay via a charged Higgs boson 
could be observed as a deficit, relative to the all-jets final 
states, in the ti final states with energetic leptons. 

The signal for these all-jets ti events is at least six re- 
constructed jets. The main background is from QCD 
multijet events that arise from a 2— >2 parton process 
producing two energetic ("hard") leading jets and less 
energetic ("soft") radiated gluon jets. 

The D0 detector is described in Ref. [9. We used 
the same reconstruction algorithms for jets, muons, and 
electrons as those used in previous top quark analyses 
H . The muons in this analysis are used to identify b jets, 
and are restricted to the pseudorapidity range \r]\ < 1.0, 
where r\ — tanh -1 (cos0), and 9 is the polar angle relative 
to the beam axis. 

The multijet data sample was selected using a hard- 
ware trigger and an online filter requiring five jets of cone 
size 1Z = 0.5, pseudorapidity \rj\ < 2.5 and transverse en- 
ergy E T > 10.0 GeV. Here, TZ = ((A^) 2 + (Ar ? ) 2 )i , where 
(f> is the azimuthal angle around the beam axis. Addition- 
ally, we required the total transverse energy of the event 
(Ht) to be > 115 or 120 GeV (depending on run con- 
ditions). The data sample after the initial cuts has ~ 
600,000 events. With about 200 expected top events in 
this channel, the background overwhelms the signal by a 
factor of « 3000. As discrimination from many variables 
was needed to separate signal from background, most of 
which are significantly correlated, we used neural net- 
works (NN) as an integral part of this analysis. 

The offline analysis proceeded by excluding events with 
an isolated muon or electron to maintain a data sample 
independent of the other ti samples. We required events 
to have at least six 1Z=0.3 cone jets and less than nine 
1Z=0.5 cone jets, with jet E T > 8.0 GeV. We gener- 
ally used 7£=0.3 cone jets because of their greater recon- 
struction efficiency, but used 7?.=0.5 cone jets to calculate 
mass-related variables. We required that at least one jet 
have an associated muon which satisfied muon quality 
criteria and which was kinematically consistent with a 



b — > fiX decay within the jet. As about 20% of ti all- 
jets events have such a "^-tagged" jet in the acceptance 
region for ti signal, compared to approximately 3% of 
the QCD multijet background in that region, the tag- 
ging requirement reduces the background-to-signal ratio 
by about an order of magnitude. Of the total 280,000 
events surviving the offline cuts, 3853 have at least one 
/i-tagged jet. These tagged events comprise the data sam- 
ple used to extract the cross section. 

Compared with the QCD multijet background, ti 
events typically have more energetic jets, have the to- 
tal energy more uniformly distributed among the jets, 
are more isotropic, and have their jets distributed at 
smaller rj. To discriminate ti signal from QCD back- 
ground, we defined at least two variables describing each 
of these qualities (total energy, jet energy distribution, 
event shape, and rapidity distribution) JjJ: 

1. Ht'- The sum of the transverse energies of jets. 

2. v^: The invariant mass of the jets in the final state. 

3. E^/Ht'- Et ± is the transverse energy of the lead- 
ing jet. 

4. HJ: Ht without the transverse energy of the two 
leading jets. 

5. N^ ts : The number of jets averaged over a range 
of Et thresholds (15 to 55 GeV), and weighted by 
the Et threshold. This parameterizes the number 
of jets taking their hardness into account. 

6. Et & b '■ The square root of the product of the trans- 
verse energies of the fifth and sixth jets. 

7. A: The aplanarity, calculated from the normalized 
momentum tensor. 

8. S: The sphericity, calculated from the normalized 
momentum tensor. 

9. C: The centrality, C = Ht/He, where He is the 
sum of all the jet total energies. This characterizes 
the transverse energy flow. 

10. <rj 2 >: The ^-weighted mean square of the r\ dis- 
tribution of jets in an event. 

These ten variables are the inputs to the first neural 
network (NN1), whose output is used as an input variable 
for the second neural network (NN2). The three other 
inputs to NN2 are: 

11. p T : The transverse momentum of the tagging 
muon. 

Thep T distribution is harder for tagged jets in ti events 
than for tagged jets in QCD multijet events. 
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FIG. 1. Comparison of the absolute number of data-based 
multijet background (histogram) with the observed 3853 
muon-tagged events in data (o) for (a) Ht (in GeV), (b) 
E^/Ht, (c) sphericity, and (d) jet- width Fisher discriminant. 
Shown also are the distributions in these variables for herwig 
ti events (•). 

12. M.: The mass-likelihood variable. This variable is 
defined as M = [M Wl - M w ) 2 /a 2 mw + (M W2 - 
M w) 2 /v 2 mw + (m tl - m t2 ) 2 / cr m tI with the pa- 
rameters Mw, £f mw , and a mt set to 80, 16 and 62 
GeV/c 2 , respectively. Mw ( and refer to the jet 
combinations that best define the W boson and top 
quark masses in an event. 

The mass likelihood variable M. is a x 2 -bke quantity, 
minimized when there are two invariant masses consistent 
with the W mass, and two candidate top quark masses 
that are identical. o 2 nw and <r;L were determined from 
simple two and three jet combinations using D0 jet res- 
olutions. We did not assume that the muon tagged jet 
came from a b quark. 

13. T: The jet-width Fisher discriminant. This is de- 
fined as J^jct = O jet - Crquark(^T)) 2 / cr quark(- B T) " 
l^jet ^gluon 

(£' T )) 2 /cr 2 luon (i; T ), where o\^ v {E T ) 
and cr 2 luon (i? T ) are mean square jet widths calcu- 
lated from herwig H Monte Carlo, for quarks and 
gluons respectively, as functions of jet Ex- 
it has been demonstrated that quark jets are, on av- 
erage, narrower than gluon jets ^,0]. The Fisher dis- 
criminant, based on the r)-(j) RMS jet widths, is calculated 
for the four narrowest jets in the event, and indicates 
whether the jets were most probably "quark-like" (ti) or 
"gluon-like" (QCD multijet). 

Figure |l| shows a comparison of distributions from the 
modeled background discussed below, the data, and HER- 
WIG ti events for four of the above variables. 

The top quark production cross section is calculated 
from the output of NN2. Both networks were trained to 
force their output near 1 for ti events, and near for 
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Neural Net Output 

FIG. 2. The distribution in NN2 output (on a log scale) for 
data (diamonds + error bars) and the fits for expected signal 
and background. The signal was modeled using herwig for 
m t = 180 GeV/c 2 . The errors shown are statistical. 

QCD multijet events, using the back-propagation learn- 
ing algorithm in jetnet JjJ. 

The very large background-to-signal ratio in the un- 
tagged data indicates an almost pure background sam- 
ple. With a correction for the very small ti component 
expected, and with a method of assigning a muon tag 
to the untagged event, the background estimate can be 
determined directly from the data. Separate sets of un- 
tagged data with added muon tags were used for network 
background response training and background modeling. 
herwig ti events were used for the ti network signal re- 
sponse training. 

The correct assignment of muon tags to the untagged 
data was critical to our background model. We derived 
a "tag rate function" from the entire multijet data set, 
defined as the probability for any individual jet to have a 
tagging muon. We chose a function that factorized into 
two pieces: e, the detector efficiency dependent on r\ of 
the jet and the run number of the event (to account for 
chamber aging), and /(Et), the probability that a jet 
of tranverse energy Et has a tagging muon. We studied 
two parametrizations of /(Et), and used the difference to 
estimate the systematic error from this source. Finally, a 
small dependence of the tag rate on \/i of the event was 
found, which was incorporated into /(Et)- A detailed 
discussion of the tag rate function is given in Ref. |l| . 

We established that the px of the tagging muon and 
the Et of the tagged jet (uncorrected for the muon and 
neutrino energy) are uncorrelated. Therefore, the muon 
Pt factors out of the tag rate function, and can be gen- 
erated independently. By applying the tag rate function 
to each jet in the untagged data sample, and generat- 
ing a muon pt for those jets determined as tagged, we 
produced the background model sample. 

The NN2 output distributions for data, modeled back- 
ground and herwig ti signal are plotted in Fig. |2[ We 
excluded events in the region of NN2 output < 0.02. Jets 
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FIG. 3. The (a) background normalization and (b) ti cross 
section from fits to the NN2 output distribution as data points 
in Fig. ^ are removed at low NN2 output values. The hori- 
zontal lines are the calculated normalization and cross section, 
respectively, from Figure ^. Error bars are statistical, but are 
correlated through the error matrix. 

in that region tend to have low Et, where the tag rate 
is not well determined due to the low tagging probability 
(low statistics), and consequently, the background mod- 
eling may be less accurate. The cross section is obtained 
from a simultaneous fit of the data to the background and 
herwig tt shapes, with the background normalization 
(Abkg) and the ti cross section (a t t) as free parameters. 
The result of this fit is also shown in Fig. ||. 

The stability of our result can be checked by succes- 
sively eliminating data points at the lowest values of the 
NN2 output. Figure H shows the values of the background 
normalization and tt cross section as the data points are 
removed and the remaining points are refitted. The refit- 
ted cross sections are independent of NN2 output region, 
confirming that the initial NN2 output cut at 0.02, and 
choosing the region NN2 output > 0.1 for our final cross 
section calculation, does not bias the result. Because of 
the preponderance of background at the low end of NN2 
and the stability of our fits, we use the region NN2 > 0.1 
for our quoted cross section results. 

Values of the cross section and background normaliza- 
tion are obtained from similar fits with herwig tt events 
generated at different top quark masses. The results are 
shown in Table I. Interpolating to the top quark mass 
as measured by D0 f§] (m t = 172.1 GeV/c 2 ), we ob- 
tain a t i = 7.1 ± 2.8 (stat.) ± 1.5 (syst.) pb, consistent 
with a previous measurement in this channel Q , and the 
most precise value for this channel to date. Table II sum- 
marizes the contributions to the systematic error on the 
cross section. These were determined by varying each 
source by its uncertainty, and calculating the difference 
in the cross section. 

As a check, we calculated the cross section from the 
excess events over expected background, using the efH- 



TABLE I. Results of the fits to neural network output. 



Top Quark 


A 

^T-bkg 




X 1 d.o.t. 


Mass (GeV/c 2 ) 




(Pb) 




140 


1.05 ± 0.03 


18.4 ± 7.8 


17.6 / 17 


160 


1.06 ± 0.03 


9.3 ± 3.8 


17.2 / 17 


170 


1.07 ± 0.02 


7.2 ± 3.0 


17.1 / 17 


180 


1.07 ± 0.03 


6.3 ± 2.5 


16.9 / 17 


200 


1.07 ± 0.03 


5.1 ± 2.0 


16.8 / 17 


220 


1.07 ± 0.03 


4.4 ± 1.7 


16.7 / 17 



ciency of the criteria for tt selection (calculated using 
herwig), along with the branching ratio and the mea- 
sured luminosity. For NN2 > 0.85 (chosen to minimize 
the error on the cross section) we observed 41 events with 
24.8 ± 2.4 expected background events for an excess of 
16.2 events. The excess corresponds to a ti cross section 
of 7.3 ± 3.3 ± 1.6 pb at m t = 172.1 GeV/c 2 , consistent 
with our result above. 

The significance of the excess is characterized by the 
probability P of the observed number of events being 
due to fluctuation. For an NN2 output threshold of » 
0.94, where Monte Carlo studies predict maximal ex- 
pected significance, we observe 18 events where 6.9 ± 0.9 
background events are expected, for which P = 0.0006, 
corresponding to a 3.2 standard deviation effect. This is 
sufficient to establish the existence of a it signal in the 
all-jets final state. 

To further check the validity of the tag rate function 
and hence the background model, we looked at events 
with more than one tagged jet. The modeled back- 
ground here consists of those untagged events that had 
two jets tagged by application of the tag rate function. 
We assumed that the fraction of the double-tagged events 
from correlated sources, such as bb production, is con- 
stant over the NN2 output, but refitted the background 
normalization for a possible overall correlation. A total 
of 32 double-tagged events are observed for NN2 out- 
put > 0.02 where 28.7 ± 8.2 events are expected from 
background. Two events are observed for NN2 output 
> 0.85 with 0.7 ±0.1 expected background events, and 
1.2 top events expected from Monte Carlo. The small 
excess in the double-tagged sample is consistent with our 
conclusion that the more significant excess in the singly- 
tagged sample is from tt production. 

Previous D0 measurements of ti production in the 
dilepton and single lepton channels ]l0| give an aver- 
age cross section of 5.6 ± 1.4 (stat.) ± 1.2 (syst.) pb 
at m t = 172.1 GeV/c 2 , in very good agreement with that 
from the all-jets channel. We combine the all-jets cross 
section with these results, assuming the statistical errors 
are uncorrelated, and that the systematic errors have 
the appropriate correlation coefficients. The combined 
D0 result for the it production cross section is 5.9 ± 
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TABLE II. Summary of statistical and systematic uncer- 
tainties for the cross section. 



Background Source 


Size of Uncertainty 


Statistical error 


4 % 


Functional Form of the Muon-Tag Rate 


7 % 


Background Correction for tt Signal 


6 % 


Background Et scale 


9 % 


Signal Source 


Size of Uncertainty 


Statistical Error 


3 % 


Trigger Turn-on 


5 % 


Luminosity Error 


5 % 


Jet Energy Scale 


6 % 


tt Tag Rate 


7% 


Model Dependence 


6 % 


b — ► fi Branching Fraction 


6 % 


muon pT Dependence 


7% 


T Dependence 
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